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Abstract 

Online communities are becoming increasingly important as platforms for large-scale human cooperation. 
These communities allow users seeking and sharing professional skills to solve problems collaboratively. 
To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze 
StackExchange, one of the largest question and answer systems in the world. We construct attention 
networks to model the growth of 110 communities in the StackExchange system and quantify individual 
answering strategies using the linking dynamics of attention networks. We identify two types of users 
taking different strategies. One strategy (type A) aims at performing maintenance by doing simple tasks, 
while the other strategy (type B) aims investing time in doing challenging tasks. We find that the number 
of type A needs to be twice as big as type B users for a sustainable growth of communities. 


Introduction 

Humans are unique in their ability to create public goods in non-repeated situations with non-kin. In 
larger groups cooperation is more difficult due to the higher temptation to free ride on the voluntary 
contributions of others [^. Nevertheless humans are able to create pnblic goods with thousands and even 
millions of unrelated individuals. For example, there are an increasing number of online communities 
where participants put in time and effort to make voluntary contributions such as street maps , software 
[^, encyclopedic information [^, protein folding |^, and language translation [^. 

Online communities are natural experiments that give us an opportunity to test possible mechanisms 
that explain cooperation in large groups. Controlled online experiments show that if participants can 
choose group members higher levels of cooperation can be derived [^. This suggests that assortment is 
a sufficient condition to derive cooperation in large groups. However, such experiments have a duration 
of about an hour in which participants are all simultaneously online and are recruited with the promise 
of monetary payments. Whether this scales up to large groups over longer periods of time is an open 
question. 

We will demonstrate in this paper that assortment is not sufficient to derive high levels of contributions 
in online collaboration [^. Our analysis shows that at least two different types of strategies of making 
voluntary contributions are needed to sustain an online community over a longer period of time. One 
strategy (type A) aims at performing maintenance by doing simple tasks, while the other strategy (type 
B) aims investing time in doing challenging tasks. We cannot measure the motivations for those two 
strategies, but we hypothesize that the first may related to reputation in the broader community, and 
the second to intrinsic motivations and reputation among peers. 

For our empirical analysis we investigate the answering records of nearly three million users over a 
period of six years from 110 online communities. We find that type A users are important in cutting down 
the median waiting time for answers, while type B users help increase the acceptance rate of answers. 
The comparison of overall size across the studied communities suggests that a ratio equals 3:2 between 
type A and B users is preferred for bigger communities. We use these empirical finding to build an 
“attention” network model in which we formalize the strategies depicted above as linking dynamics. In 
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an attention network model the nodes are questions and the edges are successive answering activities 
connecting two questions. This model allows us to analyze the effect of individual answering behavior 
on the growth of communities. Our analysis not only supports the existence of a trade-off between 
the two types of users, but also predicts what will happen when the ratio in a community deviates 
from the optimal ratio detected. We predict that a community containing too many type A users lacks 
high quality answers, thus making it difficult to attract new questions continuously. On the contrary, 
a community composed of too many type B users has many high quality questions, but it will attract 
more new questions than it can handle. In sum, a balance between the two types of users is necessary 
for the sustainable growth of communities. At the end of the paper, we select three communities to 
illustrate the predictions of our attention network model, including “math.stackexchange.com” (which 
has an optimal ratio), “astronomy.stackexchange.com” (which contains too many type A users), and 
“electronics.stackexchange.com” (which contains too many type B users). 


Mixing Strategies and the Sustained Growth of Communities 


Figure depicts the profiles of the two types of users. Type A users prefer the easier, newer questions 
and have an answer acceptance rate higher than type B users, who have a tendency to answer the more 
difficult, older questions, across all expertise levels. The ANOVA tests on all the four variables between 
the two groups are significant (Figure [^B ~ C). 

It is natural to ask whether the ratio between type A and B users has an effect on the overall 
performance of communities. Two important indicators concerning the performance of Q&A communities 
are the median waiting time for answers and the overall acceptance rate of answers [9 10 . The waiting 


time is defined as the time interval between a question being posted and a satisfying answer being 
accepted. Median is used instead of mean, because the distribution of waiting time comprises a few 
extremely large values that will lead to biased results if using the mean value [^. The accepted answer 
rate is the fraction of the questions with an accepted answer over the total population of questions. A 
good community is expected to have a high acceptance rate and a short median waiting time [9 10 . 

In Figure]^ we calculate the fraction of type A users a {a £ [0,1]) in each of the 110 communities 
and plot the two discussed indicators against a. It turns out that type A users contribute to the waiting 
time and type B users contribute to the accepted answer rate. Therefore, a balance between these two 
types of users should be carefully chosen in order to optimize the community performance. In Figure 
[^C) we find that the maximum community size (“stackoverflow.com”) is achieved when a approximates 
0.63, i.e., the ratio of type A users to type B users is 3:2. In Figure [^A) we plot the growth curves of 
all communities and color them based on their derivation from the optimal a. This figure shows clearly 
how either too many type A users or two many type B users leads to an unsustainable growth rate. 


Prom Answering Strategies to Linking Dynamics 

We have shown a co-occurrence between an optimal ratio and the maximum community size. However, 
why a deviation from the optimal ratio is related to small communities is still unclear. Therefore, we 
use a network model to formalize our assumptions of individual answering strategies and to analyze the 
consequences of different ratios. 

We define attention networks to represent Q&A communities, in which nodes are questions and edges 
are the sequential answering activities of users. In attention networks, an answering strategy based on the 
number of the existing answers to questions can be interpreted as a degree-based rule of link increases. 
As type A users prefer easy questions (low-degree nodes) and type B users favor difficult questions (high- 
degree nodes), when these two types of users respond to a new question, they carry links from very 
different nodes. Therefore, we can simplify the model by assuming that type A strategy corresponds to 
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the rule of “preferential attachment” 11 


in which the rich get richer, and type B strategy corresponds 
in which the attractiveness of a node decreases 
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to the reversed process of “preferential attachment” 
with its degree. The reversed “preferential attachment” process is usually used to describe resource-based 
competition between nodes in flow networks such as food webs 
network 


13 , power grids 14 


15 . For example, in food webs an outbound edge transport resources from a 


and the airport 
prey” node to a 

“predator” node. If several predators are fed on the same prey, then the supplied resources have to be 
split and shared, thus decreasing the attractiveness of the prey node. We argue that this effect also exist 
in attention networks in which questions are competing for the limited attention of users. 

We use / and 1 — / (/ €[0,1]) to represent the probability of observing type A and B strategies, 
respectively (note that / is different from the empirical value of a mentioned in the last section, as a is 
not the fraction of activities but the fraction of users), and quantify the probability p{k) of a new question 
being connected to a pre-existing similar question of degree k as 


p{k) = f 




( 1 ) 


As introduced, in two extreme cases this model degenerates to the “preferential attachment” model 
(/ = 0) and the “reversed preferential attachment” model (/ = 1) , respectively. Using the master 


equation technique 16 we derive that the tail of the degree distribution will converge to 


Pk 


= k' 


( 2 ) 


in which the power exponent a has a minimum value 3 and always increases with / (see SI for the 
details). We find that for a majority of communities the empirical value of a lies in the scope of [3,5] 
(Figure ivi). supporting our derivation. As a larger power-law exponent implies higher equality in 


resource distribution 17 , our model suggests that type A strategy equalizes the allocation of attention 


(edges) among nodes and increases the chance of a new question being answered. 

Besides degree distribution, the discussed linking rules also explain several scaling relationships ob¬ 
served in the growth dynamics of attention networks as presented by Figure]^ and mentioned in 18 19 


Users are more likely to post questions when they search the Web and find that a similar question has 
obtained many answers but their concerns have not been fully addressed. As a consequence, a new 
question is more likely to be added to the network if the existing similar questions have more answers. 
To include this process in our model, we consider node replication as the main driving force underlying 
network growth and allow high-degree nodes to generate more new nodes. Considering the node-matching 
probability p{k) given by Eqlij we can calculate that the expected attractiveness of a single nodes is 


E{k) = £ kp{k) ~ N- 


( 3 ) 


k=l 


Therefore, the expected number of new nodes generated by an existing node is E{k)N^, in which we use 
to model the effect of network size. By summing E{k)N^ over all nodes in the network we obtain 
the total number of new nodes as AiV = A(fc)A®+^. Substituting this condition into Eq. [^we derive the 
scaling relationship between the number of new and old nodes 


AiV- 


N'^ = iV^+®. 


( 4 ) 


Note that if an old node generates many new nodes, then there will be a stronger competition between 
these new nodes for edges. As a result, the cost of linking to an existing node is proportional to its 
degree 12 . Meanwhile, it is reasonable to assume that new questions cannot obtain an infinite number 
of answers but have a limited “quota” that approximates constant C. Putting these two conditions 
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together, we derive the expected number of links obtained by a new node as Am = CN^/E{k), in which 
is the effect of network size. Using the conclusion of Eq. we have 

Am-(5) 

From Eq. and Eq. we can derive the scaling relationship between the number of new edges and new 
nodes: 

AM = AmAN - AN'^ = ANv+^ = AN 3-/+2« (6) 

To summarize, the above analysis explains why a mixture of different strategies is crucial to the 
sustainable growth of communities. On one hand, Eq. suggests that a community should have more 
type A users to maintain the number of answers per question (larger J); on the other hand, Eq. [^suggests 
that a community should have more type B users to attract new questions (larger rj). As a balance, Eq. 
[^predicts that an optimmal fraction of type A users, / = 3 + 2g, is preferred in order to maximize the 
value of 7 , i.e., to match the growth of answers with the growth of questions. We argue that / as the 
fraction of behavior can be viewed as the multiplication between two variables, the fraction of users a 
and answering frequency distribution w. As ic is always positive, / changes in the same direction as 
a. Therefore the derived optimimal value of / implies that there is also an optimal value of a, which is 
consistent with our empirical observation. 


Examples of Successful and Less Successful Communities 

Three communities are selected to compare the consequences of different ratios between type A and 
B users (Figure]^, including a community for math questions (math.stackexchange.com, or MATH in 
short), a community for questions about astronomy (astronomy.stackexchange.com, or ASTR), and a 
community for questions about game development (gamedev.stackexchange.com, or GAME). As given 
by the first column in Figure ASTR and GAME are not as successful as MATH in maintaining a 
sustained growth curved, and this be explained by our model. 

The fraction of type A users in MATH is approximately 0.63, which is equal to the optimal value. In 
contrast, ASTR has more type A users (a = 0.67). According to our model, questions in ASTR will be 
responded to efficiently, but there will be a slow growth of new questions. This prediction is confirmed 
by Figurej^, which shows that the average number of answers to new questions is increasing, and Figure 
HP. which shows that the increase of new questions slows down as time goes on. Meanwhile, GAME has 
a few more type B users (a = 0.62) than the optimum. According to our model, in this community new 
questions will increase so fast that they cannot be answered quickly. The fast increase of new questions 
is evidenced in Figure]^, and the shrinking budget of answers per question is observed in Figure [^. It 
is interesting to note that ASTR is one of the most traditional scientific areas and GAME is a new, fast 
developing area due to the widespread use of smart phones. We argue that neither simply being classic 
nor being hot would naturally lead to sustained growth, instead, the sustained growth of a community 
comes from the careful balance between contributors of diverse preferences. 


Conclusions and Discussions 

We look at online communities as natural experiments for collective action problems. Our results imply 
that assortment is not sufficient to derive high levels of contributions in massive collaboration. Instead, 
strategic diversity seems to be the key for sustainable online communities. In the Stack Exchange datasets, 
a mixing ratio of 3 : 2 between two types of users is found to maximize the size of communities. Type A 
users have a tendency to answer easier, newer questions and type B users prefer to answer more difficult, 
older questions. We propose an attention network model to formalize the two answering strategies of 
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users and to explain the existence of an optimal ratio. Our conclusion is that type A users contribute to 
the number of answers and type B users contribute to the quality of answers, thus both of them is crucial 
to the development of communities. 

Our work generalizes the models of Barabasi et al. and Sevim et al. to study large-scale 
cooperation in online communities. The present analysis on attention networks can also be applied to 
model a variety of other online collective behaviors such thread browsing photo tagging 21-23 , and 


news sharing 24 . The current study also has limitations, which point out the future direction of research. 


For example, to obtain a simple math model we simplify the rich behaviors of users and only consider two 
extreme strategies. Meanwhile, in attention networks we naively assume that the ratio between the two 
types of users is a constant throughout the evolution of a community. In future studies we may consider 
a ratio that changes over time. 


Materials and Methods 

Data sources 

Stack Exchange is a network of question and answer communities covering diverse topics in many 
different fields. We downloaded its database dump on January, 2014 from https: //archive. org/ 
details/stackexchange, which contains the log files of 110 communities. The smallest community 
italicui.stackexchange.com was created in November, 2013 and has 374 users, 194 questions, and 387 
answers in our data set. The largest site stackoverflow.com (SO) was created in July, 2008 and has 
2,728,224 users, 6,474,687 questions and 11,540,788 answers in our data set. 


Measuring Question DifRcnlty and User Expertise 


We use the number of answers as a proxy for the “perceived difficulty” of questions 25 in order to divide 


users according to their difficulty preferences. We firstly count the number of existing answers qij to a 
question j when a user i responds to it. Then we average this number over all m questions answered by 
a user to derive his/her average level of difficulty preference A Qij- After that, we use the grand 

mean of difficulty preferences in a community containing n users, which is the 

threshold to separate type A users (whose preference is smaller than the threshold) from type B users 
(whose preference is greater than the threshold) (FigureA). To validate the difference between the two 
types of users, we compare the other two variables, including the average age of answered questions and 
the acceptance rate of submitted answers (Figure [^C ~ D). 

The TrueSkill algorithm 26 27 was applied to validate the estimation on users’ difficulty preferences. 


In the SO community we obtain the TrueSkill scores of 912,082 users and 3,771,021 questions (please 
see SI for the details), which represent the expertise levels of users and the “real difficulty” of questions, 
respectively [^. We find that type B users are always answering more difficult questions than type A 
users at all expertise levels (Figure[^B). In our opinion this result validates our division of the two groups. 
Furthermore, we find that the TrueSkill scores of users are positively correlated with their reputation 
points in the log files (Pearson coefficient p = 0.29, p-value < 0.001), justifying the credibility of TrueSkill 


scores. 


Constructing Attention Networks 

The answering strategies of individual users can be understood from a network perspective, in which 
two questions are connected if they are answered sequentially by the same users (see SI for the details). 
From this perspective, Q&A communities are growing networks with increasing nodes (questions) and 
links (answers). We call them “attention networks” because they show the transportation of collective 
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attention in solving problems. Attention networks translate answering strategies into linking dynamics; 
hence provide a quantitative, predictive model for us to explore the collective answering behavior of users. 

From empirical data we construct a growing attention network for each of the 110 communities. The 
network properties we are interested in include the cumulative number of nodes (N) and edges (M), the 
daily increments of nodes (AA^) and edges (AM), and the number of links per node (m = M/N) and its 
daily increments (Am = AM/AN). 
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Figure 1. Two opposite answering strategies in the SO data set. Each data point shown in the 
background is a user. For each user, we use his expertise score calculated from the TrueSkill algorithm 
as the a;-axis of the data point and use several user-level metrics as the y-axis. These metrics include 
the average number of competitors in answering questions (A), the average difficulty of answered 
questions (B), the average age of answered questions (C), and the overall acceptance rate of answers 
(D). We use the mean value of the first variable, the average number of competitors, as a threshold and 
separate users into two groups, type B users and type A users. The fraction of type A users is 0.63 in 
the SO dataset. In all figures, we plot the linear-binned data of the two groups for comparison [^. We 
find that on average, type A users (triangles) tend to choose newer and easier questions and have higher 
answer acceptance rates than type B users (circles). See online version for color display. 
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Figure 2. The effect of the fraction of type A users on the performance of communities. We find that 
both of accepted answer rate (B) and median waiting time (C) decrease with the increase of type A 
users. As a good community is expected to have a high accepted answer rate and a short waiting time 
of answers [^[^, our finding suggests that a balance between the two types of users should be carefully 
chosen in order to optimize the performance. In (B) the regression coefficient is —0.48 and the p-value 
is 0.029. In (C) after removing the outlier (the data point in the gray circle) the regression coefficient is 
—352.89 and the p-value is 0.107. Figure (D) suggests that the maximum size of communities is 
archived when a « 0.63. Figure (A) shows the monthly increased number of questions and answers of 
110 sites, in which the upper bounds of bands show the number of answers and the lower bounds show 
the number of questions. These bands are plotted in different colors to shown their derivation from the 
optimal ratio a = 63%. It is observed that the larger deviation from the optimal ratio is related with 
more unsustainable growth. See online version for color display. 
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Figure 3. The scaling properties of communities. We select “math.stackexchange.com” (a = 0.63, the 
first row) as the example for sustainable growth and “gamedev.stackexchange.com” (a = 0.62, the 
second row) and “astronomy.stackexchange.com” (a = 0.67, the third row) as the examples for 
unsustainable growth. In (A), (E), and (I) we plot the monthly increased number of questions (the 
lower bounds of the bands) and answers (the upper bounds of the bands) against time. The rest of 
figures in the first three rows demonstrate the scaling behaviors of communities. In particular, the 
second column corresponds to Eq. the third column corresponds to Eq. and the fourth column 
corresponds to Eq. In these figures the OLS regression (gray) lines in logarithmic axes are plotted to 
demonstrate the estimated scaling exponents. We find that as predicted by our model, in 
“gamedev.stackexchange.com” new questions increase so fast (F) that they cannot be answered on time 
(G); while in “astronomy.stackexchange.com” questions are answered quickly (K) but the increase of 
new questions is very slow (J). The last row gives the distributions of the four scaling parameters across 
the 110 communities. The mean values of the parameters are 3.7 (a), -0.04 (rj), -0.08 (5), and 0.71 ( 7 ). 
See online version for color display. 















































